56 research outputs found

    Genetic Variations and Diseases in UniProtKB/Swiss-Prot: The Ins and Outs of Expert Manual Curation.

    Get PDF
    During the last few years, next-generation sequencing (NGS) technologies have accelerated the detection of genetic variants resulting in the rapid discovery of new disease-associated genes. However, the wealth of variation data made available by NGS alone is not sufficient to understand the mechanisms underlying disease pathogenesis and manifestation. Multidisciplinary approaches combining sequence and clinical data with prior biological knowledge are needed to unravel the role of genetic variants in human health and disease. In this context, it is crucial that these data are linked, organized, and made readily available through reliable online resources. The Swiss-Prot section of the Universal Protein Knowledgebase (UniProtKB/Swiss-Prot) provides the scientific community with a collection of information on protein functions, interactions, biological pathways, as well as human genetic diseases and variants, all manually reviewed by experts. In this article, we present an overview of the information content of UniProtKB/Swiss-Prot to show how this knowledgebase can support researchers in the elucidation of the mechanisms leading from a molecular defect to a disease phenotype

    An enhanced workflow for variant interpretation in UniProtKB/Swiss-Prot improves consistency and reuse in ClinVar.

    Get PDF
    Personalized genomic medicine depends on integrated analyses that combine genetic and phenotypic data from individual patients with reference knowledge of the functional and clinical significance of sequence variants. Sources of this reference knowledge include the ClinVar repository of human genetic variants, a community resource that accepts submissions from external groups, and UniProtKB/Swiss-Prot, an expert-curated resource of protein sequences and functional annotation. UniProtKB/Swiss-Prot provides knowledge on the functional impact and clinical significance of over 30 000 human protein-coding sequence variants, curated from peer-reviewed literature reports. Here we present a pilot study that lays the groundwork for the integration of curated knowledge of protein sequence variation from UniProtKB/Swiss-Prot with ClinVar. We show that existing interpretations of variant pathogenicity in UniProtKB/Swiss-Prot and ClinVar are highly concordant, with 88% of variants that are common to the two resources having interpretations of clinical significance that agree. Re-curation of a subset of UniProtKB/Swiss-Prot variants according to American College of Medical Genetics and Genomics (ACMG) guidelines using ClinGen tools further increases this level of agreement, mainly due to the reclassification of supposedly pathogenic variants as benign, based on newly available population frequency data. We have now incorporated ACMG guidelines and ClinGen tools into the UniProt Knowledgebase (UniProtKB) curation workflow and routinely submit variant data from UniProtKB/Swiss-Prot to ClinVar. These efforts will increase the usability and utilization of UniProtKB variant data and will facilitate the continuing (re-)evaluation of clinical variant interpretations as data sets and knowledge evolve

    Monoubiquitination of syntaxin 3 leads to retrieval from the basolateral plasma membrane and facilitates cargo recruitment to exosomes

    Get PDF
    Syntaxin 3 (Stx3), a SNARE protein located and functioning at the apical plasma membrane of epithelial cells, is required for epithelial polarity. A fraction of Stx3 is localized to late endosomes/lysosomes, although how it traffics there and its function in these organelles is unknown. Here we report that Stx3 undergoes monoubiquitination in a conserved polybasic domain. Stx3 present at the basolateral—but not the apical—plasma membrane is rapidly endocytosed, targeted to endosomes, internalized into intraluminal vesicles (ILVs), and excreted in exosomes. A nonubiquitinatable mutant of Stx3 (Stx3-5R) fails to enter this pathway and leads to the inability of the apical exosomal cargo protein GPRC5B to enter the ILV/exosomal pathway. This suggests that ubiquitination of Stx3 leads to removal from the basolateral membrane to achieve apical polarity, that Stx3 plays a role in the recruitment of cargo to exosomes, and that the Stx3-5R mutant acts as a dominant-negative inhibitor. Human cytomegalovirus (HCMV) acquires its membrane in an intracellular compartment and we show that Stx3-5R strongly reduces the number of excreted infectious viral particles. Altogether these results suggest that Stx3 functions in the transport of specific proteins to apical exosomes and that HCMV exploits this pathway for virion excretion

    The IntAct molecular interaction database in 2012

    Get PDF
    IntAct is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. Two levels of curation are now available within the database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported. As from September 2011, IntAct contains approximately 275 000 curated binary interaction evidences from over 5000 publications. The IntAct website has been improved to enhance the search process and in particular the graphical display of the results. New data download formats are also available, which will facilitate the inclusion of IntAct's data in the Semantic Web. IntAct is an active contributor to the IMEx consortium (http://www.imexconsortium.org). IntAct source code and data are freely available at http://www.ebi.ac.uk/intact

    VAMP7 modulates ciliary biogenesis in kidney cells

    Get PDF
    Epithelial cells elaborate specialized domains that have distinct protein and lipid compositions, including the apical and basolateral surfaces and primary cilia. Maintaining the identity of these domains is required for proper cell function, and requires the efficient and selective SNARE-mediated fusion of vesicles containing newly synthesized and recycling proteins with the proper target membrane. Multiple pathways exist to deliver newly synthesized proteins to the apical surface of kidney cells, and the post-Golgi SNAREs, or VAMPs, involved in these distinct pathways have not been identified. VAMP7 has been implicated in apical protein delivery in other cell types, and we hypothesized that this SNARE would have differential effects on the trafficking of apical proteins known to take distinct routes to the apical surface in kidney cells. VAMP7 expressed in polarized Madin Darby canine kidney cells colocalized primarily with LAMP2-positive compartments, and siRNA-mediated knockdown modulated lysosome size, consistent with the known function of VAMP7 in lysosomal delivery. Surprisingly, VAMP7 knockdown had no effect on apical delivery of numerous cargoes tested, but did decrease the length and frequency of primary cilia. Additionally, VAMP7 knockdown disrupted cystogenesis in cells grown in a three-dimensional basement membrane matrix. The effects of VAMP7 depletion on ciliogenesis and cystogenesis are not directly linked to the disruption of lysosomal function, as cilia lengths and cyst morphology were unaffected in an MDCK lysosomal storage disorder model. Together, our data suggest that VAMP7 plays an essential role in ciliogenesis and lumen formation. To our knowledge, this is the first study implicating an R-SNARE in ciliogenesis and cystogenesis. © 2014 Szalinski et al

    The UniProt-GO Annotation database in 2011

    Get PDF
    The GO annotation dataset provided by the UniProt Consortium (GOA: http://www.ebi.ac.uk/GOA) is a comprehensive set of evidenced-based associations between terms from the Gene Ontology resource and UniProtKB proteins. Currently supplying over 100 million annotations to 11 million proteins in more than 360 000 taxa, this resource has increased 2-fold over the last 2 years and has benefited from a wealth of checks to improve annotation correctness and consistency as well as now supplying a greater information content enabled by GO Consortium annotation format developments. Detailed, manual GO annotations obtained from the curation of peer-reviewed papers are directly contributed by all UniProt curators and supplemented with manual and electronic annotations from 36 model organism and domain-focused scientific resources. The inclusion of high-quality, automatic annotation predictions ensures the UniProt GO annotation dataset supplies functional information to a wide range of proteins, including those from poorly characterized, non-model organism species. UniProt GO annotations are freely available in a range of formats accessible by both file downloads and web-based views. In addition, the introduction of a new, normalized file format in 2010 has made for easier handling of the complete UniProt-GOA data set

    Topology of molecular machines of the endoplasmic reticulum: a compilation of proteomics and cytological data

    Get PDF
    The endoplasmic reticulum (ER) is a key organelle of the secretion pathway involved in the synthesis of both proteins and lipids destined for multiple sites within and without the cell. The ER functions to both co- and post-translationally modify newly synthesized proteins and lipids and sort them for housekeeping within the ER and for transport to their sites of function away from the ER. In addition, the ER is involved in the metabolism and degradation of specific xenobiotics and endogenous biosynthetic products. A variety of proteomics studies have been reported on different subcompartments of the ER providing an ER protein dictionary with new data being made available on many protein complexes of relevance to the biology of the ER including the ribosome, the translocon, coatomer proteins, cytoskeletal proteins, folding proteins, the antigen-processing machinery, signaling proteins and proteins involved in membrane traffic. This review examines proteomics and cytological data in support of the presence of specific molecular machines at specific sites or subcompartments of the ER

    The Gene Ontology resource: enriching a GOld mine

    Get PDF
    The Gene Ontology Consortium (GOC) provides the most comprehensive resource currently available for computable knowledge regarding the functions of genes and gene products. Here, we report the advances of the consortium over the past two years. The new GO-CAM annotation framework was notably improved, and we formalized the model with a computational schema to check and validate the rapidly increasing repository of 2838 GO-CAMs. In addition, we describe the impacts of several collaborations to refine GO and report a 10% increase in the number of GO annotations, a 25% increase in annotated gene products, and over 9,400 new scientific articles annotated. As the project matures, we continue our efforts to review older annotations in light of newer findings, and, to maintain consistency with other ontologies. As a result, 20 000 annotations derived from experimental data were reviewed, corresponding to 2.5% of experimental GO annotations. The website (http://geneontology.org) was redesigned for quick access to documentation, downloads and tools. To maintain an accurate resource and support traceability and reproducibility, we have made available a historical archive covering the past 15 years of GO data with a consistent format and file structure for both the ontology and annotations

    Gene Ontology annotations and resources.

    Get PDF
    The Gene Ontology (GO) Consortium (GOC, http://www.geneontology.org) is a community-based bioinformatics resource that classifies gene product function through the use of structured, controlled vocabularies. Over the past year, the GOC has implemented several processes to increase the quantity, quality and specificity of GO annotations. First, the number of manual, literature-based annotations has grown at an increasing rate. Second, as a result of a new 'phylogenetic annotation' process, manually reviewed, homology-based annotations are becoming available for a broad range of species. Third, the quality of GO annotations has been improved through a streamlined process for, and automated quality checks of, GO annotations deposited by different annotation groups. Fourth, the consistency and correctness of the ontology itself has increased by using automated reasoning tools. Finally, the GO has been expanded not only to cover new areas of biology through focused interaction with experts, but also to capture greater specificity in all areas of the ontology using tools for adding new combinatorial terms. The GOC works closely with other ontology developers to support integrated use of terminologies. The GOC supports its user community through the use of e-mail lists, social media and web-based resources
    corecore